A naive Bayes classifier on 1998 KDD Cup
نویسنده
چکیده
The 1998 KDD Data cup provides a large dataset that has a number of features which can be learned to attempt to predict potential respondents to a mailing. It is our goal to show that the naive Bayes classifier may be accurate enough to successfully choose who will reply to the mailing. By using cross validation, we hope to establish a basis for the expected performance. We also analyze the space and time complexity of the classifier in order to compare with the theoretical thresholds of the naive Bayes algorithm.
منابع مشابه
An Ensemble of Three Classifiers for KDD Cup 2009: Expanded Linear Model, Heterogeneous Boosting, and Selective Naive Bayes
متن کامل
Using Text Classification to Predict the Gene Knockout Behaviour of S. Cerevisiae
A naive Bayes classifier was used to analyze gene behavior based on text data and presented as an entry for the 2002 KDD Cup, a data mining exercise to predict the behavior of the yeast S. Cerevisiae. The solution presented was based on the multinomial event model for text classification(McCallum & Nigam 1998) with a feature selection mechanism added. Despite this simple model, performance clos...
متن کاملA K-Means and Naive Bayes learning approach for better intrusion detection
Intrusion Detection Systems (IDS) have become an important building block of any sound defense network infrastructure. Malicious attacks have brought more adverse impacts on the networks than before, increasing the need for an effective approach to detect and identify such attacks more effectively. In this study two learning approaches, K-Means Clustering and Naïve Bayes classifier (KMNB) are u...
متن کاملA New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier
With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...
متن کاملNetwork Intrusion Detection Using a Hidden Naïve Bayes Binary Classifier
Using data mining techniques in intrusion detection systems is common for the classification of the network events as either normal events or attack events. Naïve Bayes (NB) method is a simple, efficient and popular data mining method that is built on conditional independence of attributes assumption. Hidden Naïve Bayes (HNB) is an extended form of NB that keeps the NB's simplicity and efficien...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006